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Abstract 

We use Stein's method to obtain bounds on the rate of convergence for 
a class of statistics in geometric probabiUty obtained as a sum of contri- 
butions from Poisson points which are exponentially stabilizing, i.e. locally 
determined in a certain sense. Examples include statistics such as total edge 
length and total number of edges of graphs in computational geometry and 
the total number of particles accepted in random sequential packing models. 
These rates also apply to the 1-dimensional marginals of the random measures 
associated with these statistics. 
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1 Introduction 



In the study of limit theorems for functionals on Poisson or binomial spatial point 
processes, the notion of stabilization has recently proved to be a useful unifying 
concept [4, 11, 13]. Laws of large numbers and central limit theorems can be proved 
in the general setting of functionals satisfying an abstract 'stabilization' property 
whereby the insertion of a point into a Poisson process has only a local effect in some 
sense. These results can then be applied to deduce limit laws for a great variety of 
particular functionals, including those concerned with the minimal spanning tree, 
the nearest neighbor graph, Voronoi and Delaunay graphs, packing, and germ-grain 
models. 

Several different techniques are available for proving general central limit the- 
orems for stabilizing functionals. These include a martingale approach [11] and a 
method of moments [4]. In the present work, we revisit a third technique for proving 
central limit theorems for stabilizing functionals on Poisson point processes, which 
was introduced by Avram and Bertsimas [1]. This method is based on the normal 
approximation of sums of random variables which are 'mostly independent of one 
another' in a sense made precise via dependency graphs, which in turn is proved 
via Stein's method [14] . It has the advantage of providing explicit error bounds and 
rates of convergence. 

We extend the work of Avram and Bertsimas in several directions. First, whereas 
in [1] attention was restricted to certain particular functionals, here we derive a gen- 
eral result holding for arbitrary functionals satisfying a stabilization condition which 
can then be checked rather easily for many special cases. Second, we consider non- 
uniform point process intensities and do not require the functionals to be translation 
invariant. Third, wc improve on the rates of convergence in [1] by making use of the 
recent refinement by Chen and Shao [7] of previous normal approximation results 
for sums of 'mostly independent' variables. Finally, wc apply the methods not only 
to random variables obtained by summing some quantity over Poisson points, but 
to the associated random point measures, thereby recovering many of the results of 
Baryshnikov and Yukich [4] on convergence of these measures, and without requiring 
higher order moment calculations. We add to [4] by providing information about 
the rate of convergence, and relaxing the continuity conditions required in [4] for 
test functions and point process intensities. 

A brief comparison between the methods of deriving central limit theorems for 
functionals of spatial point processes is warranted. Only the dependency graph 
method used here, to date, has yielded error bounds and rates of convergence. On 
the other hand, our method requires bounds on the tail of the 'radius of stabilization' 
(i.e., on the range of the local effect of an inserted point). The martingale method, 
in contrast, requires only that this radius be almost surely finite, and for this reason 
is applicable to some examples such as those concerned with the minimal spanning 
tree, for which no tail bounds are known and which therefore lie beyond the scope of 
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the present work. The moment method [4] and martingale method [10], unhke the 
dependency graph method, provide information about the variance of the Gaussian 
hmits. The moment method has also been used [5] to establish moderate scale limit 
behavior of functionals of spatial point processes. Whereas the moment method 
requires exponential tail bounds for the radius of stabilization, one of our central 
hmit theorems (Theorem 2.2) requires only that this tail T{t) decay as a (large) 
negative power of t. 

With regard to case of use in applications, the dependency graph method and 
method of moments require checking tail bounds for the radius of stabilization, 
which is usually straightforward where possible at all. The method of moments 
requires a more complicated (though checkable) version of the bounded moments 
condition (2.5) below (see [4]). The dependency graph method requires some sepa- 
rate calculation of variances if one wishes to identify explicitly the variance of the 
limiting normal variable. The martingale method requires the checking of slightly 
more subtle versions of the stabihzation conditions needed here [10, 11]. 

2 General results 

Let d > 1 be an integer. For the sake of generality, we consider marked point 
processes in M.'^. Let {Ai, J-'m,'^m) be a probability space (the mark space). Let 
^{{x, s); X) be a measurable M-valued function defined for all pairs ((x, s), X), where 
X d X M. IS finite and where {x,s) & X {so x & and s G M). When 
{x,s) e (M*^ X M) \ X, we abbreviate notation and write ^{{x,s)]X) instead of 
a{x,slXU{{x,s)}). 

Given X d x M, a > Q d.nA y e , we let aX := {{ax,t) : {x,t) E X} 
and y + X := {{y + x,t) : {x,t) G X}; in other words, translation and scalar 
multiplication on X Ai act only on the first component. For all A > let 

Ui^, s); X) := s);x + X'/\-x + X)). 

We say ^ is translation invariant if ^{{y + x,s);y + X) = ^{{x, s); X) for all 
y G W^, all {x,s) eW^ X M and all finite X G x M. When ^ is translation 
invariant, the functional simplifies to ^x{{x, s); X) = ^{{X^^'^x, s); X^^'^X). 

Let K be a probability density function on M.'^ with compact support A C M.'^. For 
all A > 0, let Vx denote a Poisson point process in M"^ x with intensity measure 
{XK{x)dx) X F_f^{ds). We shall assume throughout that k is bounded with supremum 
denoted ||k||oo- 

Let {Ax, A > 1) be a family of Borel subsets of A. The simplest case, with 
Ax = A for all A, covers all examples considered here; we envisage possibly using 
the general case in future work. 

The following notion of exponential stabilization, adapted from [4] , plays a cen- 
tral role in all that follows. For x eW' and r > 0, let Br{x) denote the Euclidean ball 
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centered at x of radius r. Let U denote a random element of M. with distribution 
Px, independent of V\. 

Definition 2.1 ^ is exponentially stabilizing with respect to k and (^a)a>i if for 
all \ > 1 and all x G Ax, there exists an a.s. finite random variable R := R{x, A) (a 
radius of stabilization for ^ at x) such that for all finite X <Z {A \ Bx-i/dji{x)) x Ai, 
we have 

{{x, U); [Px n (Bx-i/anix) x M)] U A') = {{x, U);Vx n {Bx-./<iR{x) x M)) , 

(2.1) 

and moreover the tail probability r{t) defined for t > by 

T{t) := sup P[R{x, A) > t] (2.2) 

A>1, xeAx 

satisfies 

lim sup t~^ log T(i) < 0. (2.3) 

t— >oo 

For > 0, we say ^ is polynomially stabilizing of order 7 if the above conditions 
hold with (2.3) replaced by the condition lim sup^^^ i'''r(t) < 00. 

Condition (2.1) may be cast in a more transparent form as follows. Each point 
of A" is a pair (x, U), with x E M.^ and U E Ai, but for notational convenience we 
can view it as a point x in R*^ carrying a mark U :— U^- Then we can view X as 
a point set in with each point carrying a mark in M.. With this interpretation, 
(2.1) stipulates that for all finite (marked) A" C ^ \ B^^-i/dji^x)., we have 

ix {x- {Vx n Bx-i/dRix)) UX)^Cx (x; Vx n Bx-vdR{x)) . (2.4) 

Roughly speaking, R := R{x, A) is a radius of stabilization if the value of Cx{x', Vx) 
is unaffected by changes to the points outside Bx-i/dji{x). 

Functionals of spatial point processes often satisfy exponential stabilization (2.1) 
(or (2.4)); here is an example. Suppose M. = [0, 1] and Fm is the uniform distribu- 
tion on [0, 1]. Suppose that A is convex or polyhedral, and k, is bounded away from 
zero on A. Suppose a measurable function {q{x),x e A) is specified, taking values 
in [0, 1] . Adopting the conventions of the preceding paragraph, for a marked point 
set X G 'R'^ let us denote each point x E X as 'red' if Ux < q(x) and as 'green' if 
Ux > q{x). Let ^{x\ X) take the value if the nearest neighbor of a; in A* has the 
same color as x, and take the value 1 otherwise. Note that ^ is not translation invari- 
ant in this example, unless q{-) is constant. For x E Alei R :— R{x, A) denote the 
distance between X^^^x and its nearest neighbor in X^^'^Vx- Then stabihzation (2.4) 
holds because points lying outside Bx-i/dR{x) will not change the value of ^xix] Vx), 
and it is easy to see that R has exponentially decaying tails. This example is rele- 
vant to the multivariate two-sample test described by Henze [8]. See Section 3 for 
further examples. 
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Definition 2.2 ^ has a moment of order p > (with respect to k and {Ax)x>i) if 

sup E[\U{x,Uy,Vx)\P]<oo. (2.5) 

A>1, xeAx 

For A > 0, define the random weighted point measure on M*^ by 

(x,s)eVxn{AxxM) 

and the centered version := — E [fi\]. 

Let -B(A) denote the set of bounded Borei-measurable functions on A. Given 
/ e B{A), let L/d^ and (/,7li) := J^/d/^. 

Let $ denote the distribution function of the standard normaL Our main result 
is a normal approximation result for (/, /I^), suitably scaled. 

Theorem 2.1 Suppose \\k\\oo < oo- Suppose that ^ is exponentially stabilizing and 
satisfies the moments condition (2.5) for some p > 2. Let q e (2, 3] with q < p- Let 
f G B(A) and put Tx := (/, /U^). Then there exists a finite constant C depending on 
d, ^, K, p, q and f , such that for all \>2, 

< C(logA)'^«A(VarTA) (2.6) 

Separate arguments are required to establish the asymptotic behavior of the 
denominator (Var(Tx))^/^ in (2.6). When Ax — A for all A, it is typically the case 
for polynomially stabilizing functionals satisfying moments conditions along the lines 
of (2.5) that there is a constant (7^(/, ^, k) > such that 

hm A-^Var(/,4) = a2(/,e,«:). (2.7) 

A— >oo 

For further information about a^{f,^,K) and precise conditions under which (2.7) 
holds, see Theorem 2.4(i) of [4]. When (2.7) holds, by combining it with Theorem 
2.1 we obtain 

{f,X-'^%)^^f{0,a\f,L^)), (2.8) 

where A/'(0, cr^) denotes a centered normal distribution with variance if > 0, 
and a unit point mass at if cr^ = 0. 

In many applications (2.7) holds with (T^(/, ^, k) > 0, showing that the case q — 3 
of (2.6) yields a rate of convergence 0((log A)^'^A~''^/^) to the normal distribution. In 
other words, we will make frequent use of: 



sup 

teiR 



Tx-ETx 



(VarTx) 



< t 



1/2 - 



-m 
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Corollary 2.1 Suppose ||k||oo < oo- Suppose that ^ is exponentially stabilizing and 

satisfies the moments condition (2.5) for some p > 3. Let f G B{A) and put 
Tx := // (2.7) holds with a^{f,^,K.) > 0, then there exists a finite constant 

C depending on d, C,, n, p and f , such that for all X> 2, 



sup 



Tx-ET: 



A 



(VarrA)V2 



< t 



< C(logA)3'^A-^/2. 



Our methods actually yield normal approximation and a central limit theorem 
when the exponential decay condition is replaced by a polynomial decay condition 
of sufficiently high order. We give a further result along these lines. 

Theorem 2.2 Suppose \\k\\oo < oo. Suppose for some p > 3 that ^ is polynomially 

stabilizing of order 7 with 7 > d{150 + Q/p), and satisfies the moments condition 
(2.5). Let f G B[A) and put Tx '■= (/, yU^). Suppose that (2.7) holds for some 
cr^ > 0. Then (2.8) holds and if := cr^(/, > there exists a finite constant 
C depending on d, ^, n, p and f , such that for all \ >2, 



sup 



Ta-ETa 
(VarTA)V2 



< t 



< (7;^(150pci+6<i-p7)/2(p7-6d) 



(2.9) 



Remcirks 



1. Our results are stated for marked Poisson point processes, i.e., for Poisson 
processes inW^xM. where A4 is the mark space. These results are reduced to 
the corresponding results for unmarked Poisson point processes in by taking 
Ai to have a single element (denoted m, say) and identifying x Ai with R"' 
in the obvious way by identifying {x, m) with x for each x G M"*. In this case the 
notation (2.4) is particularly appropriate. Other treatments such as [4, 10, 11, 
12] tend to concentrate on the unmarked case with commentary that the proofs 
carry through to the marked case; here we spell out the results and proofs in 
the more general marked case, which seems worthwhile since examples such as 
those in Section 3.3 use the results for marked point processes. Our examples 
in Sections 3.1, 3.2, and 3.4 refer to unmarked point processes and in these 
examples we identify x {m} with R*^ as indicated above (so that Vx is 
viewed as a Poisson process in 



2. We are not sure if the logarithmic factors can be removed in Theorem 2.1 or 
Corollary 2.1. Avram and Bertsimas [1] obtain a rate of 0((log A)^''"^/^^''^"-^/^), 
for the length of the A;-nearest neighbors (directed) graph, the Voronoi graph, 
and the Delaunay graph (see Sections 3.1 and 3.2). Our method for general 
stabilizing functionals is based on theirs, but uses a stronger general normal 
approximation result (Lemma 4.1 below). 
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3. If (2.7) holds with a^(f,^,K) = 0, then (2.8) holds trivially by Chebyshev's 
inequality, but Theorem 2.1 does not provide any useful information on rate 
of convergence. In examples of interest, it can usually be established that 
'^^{fi^ii^) > 0) by further separate arguments. We do not discuss these in 
detail here but refer the reader to [1, 11, 4]. 

4. Theorems 2.1, 2.2, and Corollary 2.1 require neither the underlying density 
function n nor the test function / to be continuous (both of these conditions 
are imposed in [4]). In particular, these three results apply when / is the 
indicator function of a Borel subset B of A, giving normal approximation for 

5. We do not have rate of convergence results in the binomial (non-Poisson) 
setting. For central limit theorems in the binomial setting, we refer to [11] 
and [4], which treat uniform and non-uniform samples respectively. 

6. Some functionals, such as those defined in terms of the minimal spanning tree, 
stabilize without any known bounds on the rate of decay of the tail probability 
r(t). In these cases univariate and multivariate central limit theorems hold 
[10, 11] but our Theorems 2.1 and 2.2 do not apply and explicit rates of 
convergence are not known. 

3 Applications 

Applications of Corollary 2.1 to geometric probability include functionals of proxim- 
ity graphs, germ-grain models, and random sequential packing models. The follow- 
ing examples are for illustrative purposes only and are not meant to be encyclopedic. 
For simplicity we will assume that M*^ is equipped with the usual Euclidean met- 
ric. While translation invariance is not needed in the general results in Section 2, 
most of the examples treated in this section involve translation invariant functionals 
^. However, the examples can be modified to treat the (non-translation- invariant) 
situation where has a local metric structure. 

3.1 /c-nearest neighbors graph 

Let A; be a positive integer. Given a locally finite point set X C M*^, the A;-nearest 
neighbors (undirected) graph on X, denoted kNG{X), is the graph with vertex 
set X obtained by including {x, y} as an edge whenever y is one of the k nearest 
neighbors of x and/or x is one of the k nearest neighbors of y. The /c-nearest 
neighbors (directed) graph on X, denoted kNG'{X), is the graph with vertex set X 
obtained by placing a directed edge between each point and its k nearest neighbors. 

Let N^{X) denote the total edge length of the (undirected) A;-nearest neighbors 
graph on X. Note that N''{X) = J2x&x i^'^ ^ where ^''{x;X) denotes half the 
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sum of the edge lengths in kNG{X) incident to x. If A is convex or polyhedral and 
K is bounded away from on A, then is exponentially stabilizing (cf. Lemma 6.1 
of [11]) and has moments of all orders. Moreover, as shown in [4] (see e.g. display 
(2.11), Theorem 3.1), at least when / is continuous and Ax = A for all A, 

lim A-Var(/,//i) = T/« I f{xfK{x)^'^-^^/'^dx, 

A-*oo 

where denotes the limiting variance for the total edge length of the fc-nearest 
neighbors graph on \^/'^V\ when k is the uniform distribution on the unit cube. 
Since is strictly positive (Theorem 6.1 of [11]), it follows that (2.7) holds with 
(^'^{f-,i^) ^) > 0- We thus obtain via Corollary 2.1 the following rates in the CLT 
for the total edge length of N^{\^/'^Vx) improving upon Avram and Bertsimas [1] 
and Bickel and Breiman [6]. A similar CLT holds for the total edge length of the 
/c-nearest neighbors directed graph. 

Theorem 3.1 Suppose A is convex or polyhedral and k is bounded away from on 

A. Let Nx '■= N'^^X^/'^Vx) denote the total edge length of the k-nearest neighbors 
graph on X^/'^Vx- There exists a finite constant C depending on c?, and k such 
that 

<C{\ogXf''X-^/^. (3.1) 

Similarly, letting ^^{x; X) be one or zero according to whether the distance be- 
tween X and its nearest neighbor in X is less than s or not, we can verify that 

is exponentially stabilizing and that the variance of ^a^gAi/dp;^ ^^(^^j ''^^^'^^a) is 
bounded below by a positive multiple of A. We thus obtain rates of convergence 
of C>((logA)3'^A-^/2^ in the CLT for the one- dimensional marginals of the empirical 
distribution function of k nearest neighbors distances on X^^'^Vx, improving upon 
those implicit on p. 88 of [9]. 

Using the results from section 6.2 of [11], we could likewise obtain the same rates 
of convergence in the CLT for the number of vertices of fixed degree in the k nearest 
neighbors graph. 

Finally in this section, we re-consider the non-translation-invariant example given 
in Section 2, where a point at x is colored red with probability q{x) and green with 
probability 1 — q{x), and ^(x; X) takes the value if the nearest neighbor of x in 

X has the same color as x, and takes the value 1 otherwise. We can use Corollary 

2.1 to derive a central limit theorem, with 0((log A)^°'A~^''^) rate of convergence, for 
Sccg-Pa /(•^)^(-^' -^a)) where / is a bounded measurable test function. 

3.2 Voronoi and sphere of influence graphs 

We will consider the Voronoi graph for d = 2 and the sphere of influence graph for 
all d. Given a locally finite set X cB? and given x E X, the locus of points closer 



sup 



_(VarArA)V2 



< t 
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to X than to any other point in X is called the Voronoi cell centered at x. The graph 
consisting of all boundaries of Voronoi cells is called the Voronoi graph generated 

The sum of the lengths of the finite edges of the Voronoi graph on X admits 
the representation Ylixex ^i^'i ^) where ^{x\X) denotes one half the sum of the 
lengths of the finite edges in the Voronoi cell at x. If k is bounded away from 
and infinity and A is convex, then geometric arguments show that there is a random 
variable R with exponentially decaying tails such that for any x G Vx^ the value 
of ^{x]Vx) is unaffected by points outside B^-i/dji{x) [4, 11, 13]. In other words, ^ 
is exponentially stabihzing and satisfies the moments condition (2.5) for all p > 1. 
Also, the variance of the total edge length of these graphs on X^^^Vx is bounded 
below by a multiple of A. We thus obtain 0((log A)^'^A~^/^) rates of convergence in 
the CLT for the total edge length functionals of these graphs on X^^'^Vx, thereby 
improving and generalizing the results of Avram and Bertsimas [1]. 

Given a locally finite set X C M^, the sphere of infiuence graph SIG{X) is a 
graph with vertex set X, constructed as follows: for each x E X let be a ball 
around x with radius equal to mmy^x\{x}{\y ~ ^|}- Then is called the sphere of 
influence of x. We put an edge between x and y iff the balls B^ and By overlap. 
The collection of such edges is the sphere of influence graph (SIG) on X. 

The total number of edges of the sphere of infiuence graph on X admits the 
representation J2xex^(^'i where ^(x; X) denotes one half the degree of SIG at the 
vertex x. The number of vertices of fixed degree S admits a similar representation, 
with ^{x; X) now equal to one (respectively, zero) if the degree at x is 5 (respectively, 
if degree at x is not 5). \i k is bounded away from and infinity and A is convex, 
then geometric arguments show that both choices of the functional ^ stabihze (see 
sections 7.1 and 7.3 of [11]). Also, the variance of both the total number edges and 
the number of vertices of fixed degree in the SIG on X^^'^Vx is bounded below by a 
multiple of A (sections 7.1 and 7.3 of [11]). We thus obtain 0((log A)^'^A~^/^) rates 
of convergence in the CLT for the total number of edges and the number of vertices 
of fixed degree in the sphere of infiuence graph on Vx- 

3.3 Random sequential packing models 

The following prototypical random sequential packing model is of considerable sci- 
entific interest; see [12] for references to the vast literature. 

With A^(A) standing for a Poisson random variable with parameter A, we let 
Bx,i,Bx,2, ■■■,Bx,N{X) be a sequence of rf- dimensional balls of volume A~^ whose cen- 
ters are i.i.d. random rf-vectors Xi, Xn{x) with probability density function 
K : A ^ [0, oo). Without loss of generality, assume that the balls are sequenced 
in the order determined by marks (time coordinates) in [0, 1]. Let the first ball Bx,i 
be packed, and recursively for i = 2, 3, . . . , let the i-th ball Bx,i be packed iff Bx,i 
does not overlap any ball in Bx,i, ...,Bx,i-i which has already been packed. If not 
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packed, the i-th ball is discarded. 

Packing models of this type arise in diverse disciplines, including physical, chem- 
ical, and biological processes [12]. Central limit theorems for the number of accepted 
(i.e., packed) balls are established in [12, 4], whereas laws of large numbers are given 
in [13]. 

Let M. — [0, 1] with Fj^ being the uniform distribution on the unit interval. For 
any finite point set X (Z x [0,1], assume the points (x.s) G X represent the 
locations and arrival times. Assume balls of volume centered at the locations of 
X arrive sequentially in an order determined by the time coordinates, and assume as 
before that each ball is packed or discarded according to whether or not it overlaps 
a previously packed ball. Let ^{{x,s);X) be either 1 or depending on whether 
the ball centered at x at times s is packed or discarded. Consider the re-scaled 
packing functional ^x{{x, s); X) = ^{{X^^'^x, s); A^/^A"), where balls centered at points 
of X^^'^X have volume one. The random measure 

JV(A) 
i=l 

is called the random sequential packing measure induced by balls with centers arising 
from K. The convergence of the finite dimensional distributions of the packing 
measures //^ is established in [3, 4]. ^ is exponentially stabihzing [12, 3] and for 
any continuous / e S([0, 1]'') and k uniform, the variance of (/, is bounded 
below by a positive multiple of A [4], showing that (/, //^) satisfies a CLT with an 
0((log A)'^''A^^/^) rate of convergence. 

It follows easily from the stabilization analysis of [12] that many variants of 
the above basic packing model satisfy similar rates of convergence in the CLT. 
Examples include balls of bounded random radius, cooperative sequential adsorption 
([12]), and monolayer ballistic deposition ([12]). In each case the number of particles 
accepted satisfies the CLT with an 0((log A)^"'A^^/^) rate of convergence. The same 
comment applies for the number of seeds accepted in spatial birth-growth models 
[12]. 

3.4 Independence number, off-line packing 

An independent set of vertices in a graph G is a set of vertices in G, no two of which 
are connected by an edge. The independence number of G, which we denote /3(G), 
is defined to be the maximum cardinality of all independent sets of vertices in G. 

For r > 0, and for finite or countable X C M.'^, let G{X,r) denote the geometric 
graph with vertex set X and with edges between each pair of vertices distant at 
most r apart. Then the independence number f3{G{X ,r)) is the maximum number 
of disjoint closed balls of radius r/2 that can be centered at points of X; it is an 
'off-line' version of the packing functionals considered in the previous section. 
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Let 6 > be a constant, and consider the graph G{V\, h\~^/'^) (or equivalently, 
G{X^^'^Vx, b)). Random geometric graphs of this type are the subject of [9], ahhough 
independence number is considered only briefly there (on page 135). A law of large 
numbers for the independence number is described in Theorem 2.7 (iv) of [13]. 

For It > 0, let 7iu denote a homogeneous Poisson point process of intensity u on 
M*^, and let be the point process Hu with a point inserted at the origin. As on 
page 189 of [9], let Ac be the infimum of all u such that the origin has a non-zero 
probability of being in an infinite component of G{T-lu, 1). 

If ^'^ll'^lloo < Ac, we can use Corollary 2.1 to obtain a central limit theorem for 
the independence number f3{G{V\,h\^^/'^)), namely 



sup 

teiR 



(Var/3(G'(PA,&A-Vrf)))i/2 



< C(logA)3<^A-V2(3.2) 



We sketch the proof. For finite A" C R*^ and x e X, let ^{x;X) denote the 
independence number of the component of G{X, b) containing vertex x, divided by 
the number of vertices in this component. Then Xla;eA'^(-^' independence 
number of G{X,b), since the independence number of any graph is the sum of 
the independence numbers of its components. Also, our choice of ^ is translation- 
invariant, and so we obtain 

PiGiVxA-'^'b)) = (3{G{X'/''rx,b)) = ay^'^x^X^'^Vx) 

xeVx 

where we here take the test function / to be identically 1 and take Ax = A for all A. 
Thus a central limit theorem holds for /3{G{Vx: X~^^%)) by application of Corollary 
2.1, if ^ and k, satisfy the conditions for that result. 

We take R{x, A) to be the distance from X^^^x to the furthest point in the compo- 
nent containing X^^'^x of G{X^^'^Vx,b), plus 2b. Since ^x{x',Vx) is determined by the 
component of G{X^^'^Vx, b) containing X^^'^x^ and this component is unaffected by 
the addition or removal of points to/from Vx at a distance greater than X^^^'^R{x, A) 
from x, it is indeed the case that i?(a;. A) is a radius of stabilization. 

The point process X^I'^Vx is dominated by "WhkH^ (in the sense of [9], page 189). 
Hence, P[i?(x, A) > t] is bounded by the probability that the component contain- 
ing X of G{1-L\\k,\\^ U {X^^'^x}^b) has at least one vertex outside Bt_2b{^^^'^x). This 
probability does not depend on x, and equals the probability that the component of 
G(7i^d|l^ll^, 1) containing the origin includes a vertex outside B(t/b)-2- By exponen- 
tial decay for subcritical continuum percolation (Lemma 10.2 of [9]), this probability 
decays exponentially in t, and exponential stabilization of ^ follows. The moments 
condition (2.5) is trivial in this case, for any p, since < ^(x; A") < 1. 
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Thus, Corollary 2.1 is indeed applicable, provided that (2.7) holds in this case, 
with o"^ > 0. Essentially (2.7) follows from Theorem 2.1 of [4], with strict inequality 
o"^ > following from (2.10) of [4]; in the case where k is the density function 
of a uniform distribution on some suitable subset of M'^, one can alternatively use 
Theorem 2.4 of [11]. We do not go into details here about the application of results 
in this example, but we do comment further on why the distribution of the 'add one 
cost' (see [11, 4]) of insertion of a point at the origin into a homogeneous Poisson 
process is nondegenerate, since this is needed to verify cr^ > and this example was 
not considered in [11] or [4]. 

The above add one cost is the variable denoted A(oo) in the notation of [11], 
or A^(ii) in the notation of [4]. It is the independence number of the component 
containing the origin of G{l-tu\ b) minus the independence number of this component 
with the origin removed (we need only to consider the case where b'^u is subcritical). 
This variable can take the value 1, for example if the origin is isolated in G{7iu', b), 
or zero, for example if the component containing the origin has two vertices. Both 
of these possibilities have strictly positive probability, and therefore A(oo) has a 
non-degenerate distribution. 

4 Proof of Theorems 

4.1 A CLT for dependency graphs 

We shall prove Theorem 2.1 by showing that exponential stabilization implies that 
a modification of (/, 7^) has a dependency graph structure, whose definition we now 
recall (see e.g. Chapter 2 of [9]). Let X„, « G V, be a collection of random variables. 
The graph G := (V, S) is a dependency graph for Xa, a eV, if for any pair of disjoint 
sets Ai,A2 C V such that no edge in £ has one endpoint in Ai and the other in A2, 
the sigma-fields (j{Xai a e ^i}, and (j{Xai a e A2}, are mutually independent. Let 
D denote the maximal degree of the dependency graph. 

It is well known that sums of random variables indexed by the vertices of a 
dependency graph admit rates of convergence to a normal. The rates of Baldi and 
Rinott [2] and those in Penrose [9] are particularly useful; Avram and Bertsimas 
[1] use the former to obtain rate results for the total edge length of the nearest 
neighbor, Voronoi, and Delaunay graphs. 

In many cases, the following theorem of Chen and Shao [7] provides superior rate 
results. For any random variable X and any p > 0, let ||X||p = (E [|Xp])^/^. 

Lemma 4.1 (see Thm, 2.7 of [7]) Let 2 < q < 3. Let Wi, i e V, be random 
variables indexed by the vertices of a dependency graph. Let W = ^j^y W^. Assume 
that E [W^] = 1, E [Wi] = 0, and 1 1 W^l |g < ^ for allieV and for some 9 > 0. Then 

sup \P[W <t]- $(t) I < 75r>^(*-^) I Vl^". (4.1) 
t 
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4.2 Auxiliary lemmas 

To prepare for the proof of Theorem 2.1 we will need some auxiliary lemmas. 
Throughout, C denotes a generic constant depending possibly on ^, and k and 
whose value may vary at each occurrence. We assume A > 1 throughout. 

Let {px, A > 0) be a function to be chosen later, in such a way that px ^ oo and 
X'^/'^px ^ as A ^ oo. Given A > 0, let sx := A^^/^^pa, and let V := V{X) denote 
the number of cubes of the form Q — \[i=i[itSx-, {ji + ^)s\)-, with all ji e Z, such 
that Jq K{x)dx > 0; enumerate these cubes as Qi, Q2, ■ ■ ■ , Qv{X)- Since k is assumed 
to have bounded support, it is easy to see that V{X) — 0(Ap^^) as A ^ 00. 

For all 1 < i < V{X), the number of points of Vx^iQi x M) is a Poisson random 
variable Ni :— N{i/i), where 



z/j A / K{x)dx < II^IIooPa- (4-2) 



Assuming Ui > 0, choose an ordering on the points of Vx H {Qi x M.) uniformly at 
random from all (iVj)! possible such orderings. Use this ordering to list the points 
as (Xj 1, f/j^i), (XjjVj, f/j jv.), where conditional on the value of Ni, the random 
variables Xij, j = 1,2,... are i.i.d. on Qi with a density /tj(-) := Jq K{x)dx, 
and the Uij are i.i.d. in Ai with distribution independent of {Xij}. Thus 
we have the representation Vx — uJ^^\{Xij, Uij)}^^-^^. For all 1 < i < ^(A), let 
Vi :— Vx \ {{Xij, and note that Vi is a Poisson point process onM.'^ x M 

with intensity XK{x)l^d\Q.{x)dx x ¥_M{ds). 

We show that the condition (2.5), which bounds the moments of the value of ^ 
at points inserted into Vx, 

also yields bounds oiaE[\^x{{Xij,Uij);Vx)\^iAx{^i,j)^j<Ni]- More precisely, we 
have: 

Lemma 4.2 // (2.5) holds for some p > 0, then there is a constant C := C{p) such 
that for all A > 1, all j > I and 1 < i < V{X), 

E [\UXij; Vx) ■ lA,{X,,,)l,<Nf] < Cpi (4.3) 

Proof. If Ni = n, then denote {(Xj_i, C/j,i), {Xi^^i, t^i.ArJ} by By definition, 

00 „ 

E[\u{Xi,j,Ui,jy,VxyiA,{Xij)ij<^^\p] = J2 / E[\u{x,uy,x^_,\jVi)r]Ki{x)dx-Pi 

n=j -iQi^A^ 

where the expectation on the right hand side is with respect to C/, Xn-i and Vi- 
The above is bounded by 



<l^^J2 E[\uix,uyxn-iuViyp]Ki{x)dx 
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= 1/,^]/ E[\U{x,uy,rxW I \rxn{QixM)\=m]K^{x)dx■P[\Vxn{Q^xM)\ 

f ¥.[\^x{{x,U)-Vx)\'']Ki{x)dx < const, x i/^, 
where the last inequahty follows by (2.5). By (4.2), this shows (4.3). □ 
For 1 <i <V, and for j e {1, 2, . . .}, we define 

r G((X,,„ U,,)- Vx) if iV. > J, X,, e Ax 
' \ otherwise 

Lemma 4.3 // (2.5) holds for some p > q > 1, then there exists C := C{p, q) such 
that for l<i< V{X), 



3=1 



(4.4) 



Proof. Fix i < V{\) and write for ^jj. Clearly, with N :— Ni and u :— i/i, 

oo / oo ^ 



i=i 



A;=0 



< 



EE 161 

j=l k=0 



E 161 • 1 



Since a.s. only finitely many summands in the double sum are non-zero, by subad- 
ditivity of the norm, the above is bounded by 



=^E 

fe=0 



oc 

sE 

k=0 



E 161- 



EI6I-1 



+ 



(4.5) 



^ XI XI ll^j • ^N>2'^u\\n + llCj • lAr<i.||„ , 

A:=0 j=l j=l 

where here and elsewhere lx\ denotes the greatest integer less than or equal to x. 
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With r) :— (l/q) — Holder's inequality followed by (4.3) yields 

lie, • W.ll. < lle.llp • {P[N > < Cpf (P[iV > 2'=^)". (4.6) 

Substituting into (4.5) we obtain 



k=0 3=1 



(4.7) 



By tail estimates for the Poisson distribution (see e.g. (1.12) in [9]), since < 8 we 
have 

{P[N > 2^u])" < (exp (-2'=-Vlog(2'=)))'' = exp(-fc2^-^r?z/log 2), k>3. 
Hence, 

{P[N > 2^])" < 2-2^ k > max(3, 2 - log^ivi^)). 
Hence, since 77 is a constant, for > we have 



J22''+' ■ {P[N >2''u]y < E 

fc<max(3,2— log2{r;j^)) 



2fc+i 



k=0 



fe>max(3,2— log2(»7i')) 



< Cmax(l/z/, l) + 2 

where C does not depend on u. Thus by (4.2), the first term in the right hand 
side of (4.7) is 0{p'^'^^^). Also, the second term in the right hand side of (4.7) is 



Oipt^'^h by Lemma 4.2. Hence, (4.7) implies (4.4). 



□ 



4.3 Proof of Theorems 2.1 and 2.2 

We prove Theorems 2.1 and 2.2 together. When proving Theorem 2.1 we assume 
that ^ is exponentially stabilizing and (2.5) holds for some p > 2, and wc choose 
q E (2,3] with q < p. When proving Theorem 2.2 we assume that ^ is polynomially 
stabihzing of order 7 with 7 > d(150 + 6/p), and that (2.5) holds for some p > 3, 
and we set q = 3. 

Throughout this section, we fix / G B{A) and set Tx := (/, yU^). We follow 
the setup of the preceding section, with the support of k, covered by cubes of side 
A-^/V, and we now choose px- With the tail probabihty r{t) defined at (2.2), we 
choose px in such a way that there is a constant C such that for all A > 1, 

Pf (Ar(pA))(''-'^/('''^ < CX-^ and t{px) < CX'^ (4.8) 

and also 

pi < CAP/(P+'\ (4.9) 
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In the exponentially stabilizing case (Theorem 2.1) we achieve this by taking px — 
a log A for some suitably large constant a. In the polynomially stabilizing case of 
order 7 (Theorem 2.2), we take p\ — CA" with 

25p /7 d\ 25 ^ 

^ so a - - - = — . (4.10) 



— 6d \ 6 p J 6 

which implies that (4.8) holds with g = 3, and that (4.9) holds (to obtain the last 
conclusion we use our assumption on 7, which implies 7 > d{25 + 56/p).) 

For all 1 < i < and all j = 1,2,..., let Rij denote the radius of stabilization 
of ^ at {Xij, Uij) if 1 < J < A^i and Xjj G Ax; let Rij be zero otherwise. 

Let Eij := {Rij < px}- Let Ex := n]li n°^i Eij Then by Markov's inequality 
and standard Palm theory (e.g. Theorem 1.6 in 



P[El] < E 



■ V Ni 

.i=i j=i 



= A / P[R{x, A) > px\K{x)dx < Xt{px). (4.11) 

J Ax 



For each A, and x e R'^, set fx{x) :— f{x)lAx{x)- Recalling the representation 



Vx = urj?^{X,,,};ii, we have 

V(X) 

= $]$^6((^m,^m);^a) •/a(^^,,). 

i=l j=l 

To obtain rates of normal approximation for T\, it will be be convenient to consider 
a closely related sum enjoying more independence between terms, namely 

V(A) JV, 

^^^((X,,, [/,,); P,) • lE^^^.fxiX,,). 
i=i j=i 

For all 1 < i < V{X) define 

Ni 

S, := % (Varrj;)-V2 ^ ^^^^^^^.^ . ^^^^^ . ^^^^^^^.^ 

and put 5 := {VaiT;^)-^/\T{ - ETj;) = E«=?H'5i - E^^). Clearly Var5 = E^^ _ 1. 

We define a graph Gx ■— (Va,^a) as follows. The set Va consists of the sub- 
cubes Qi, Qv{\) and edges (Qj, Qj) belong to £x if d{Qi, Qj) < 2aX~^^^px, where 
d{Qi,Qj) := inf{|,x — y\ : x & Qi,y € Qj} ■ By definition of the radius of sta- 
bilization R{x, \), the value of Si is determined by the restriction of Vx to the 
A^^/'^PA-neighborhood of the cube Qi. By the independence property of the Poisson 
point process, if Ai and A2 are disjoint collections of cubes in Va such that no edge 
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in £\ has one endpoint in Ai and one endpoint in A2, then the random variables 
{SqitQi e Ai} and {Sq.^Qj e ^2} are independent. Thus Gx is a dependency 

graph for {Si]'(Sl\ 

To prepare for an apphcation of Lemma 4.1, we make five observations: 

(i) V{\) := |Va| = olXp^'^) as A ^ 00. 

(ii) Since the number of cubes in Qi, Qv distant at most 2\''^/'^p\ from a given 
cube is bounded by 5*^, it follows that the maximal degree D satisfies D :— Dx < 5"^. 

(ni) The definitions of Si and and Lemma 4.3 tell us that for all 1 < i < V{X) 



(iv) We can bound VarfTj^] as follows. Observe that T_( is the sum of V{X) 
random variables, which by the case g = 2 of Lemma 4.3 each have a second moment 
bounded by a constant multiple of p^'^'^+^^Z^. Thus the variance of each of the V^(A) 
random variables is also bounded by a constant multiple of p^'^^^+^^Z^. Moreover, 
the covariance of any pair of the V{X) random variables is zero when the indices of 
the random variables correspond to non-adjacent cubes. For adjacent cubes, by the 
Cauchy-Schwarz inequality the covariance is also bounded by a constant multiple of 

VarK] = 0(pf^+')/^A). (4.13) 

(v) Var[T(] is close to Var[TA] for A large. We require more estimates to show 
this. Note that |T( — Tx\ =0 except possibly on the set E^. Lemma 4.3, along with 
Minkowski's inequality, yields the upper bound 



V{\) Ni 

J2T.\^xiiXij,Uij);Vx)\lA,{X.,] 

i=i j=i 



< Cl^(A)pf < CApf^ (4.14) 



Since Tx — Tj^ on event Ex, the Holder and Minkowski inequahties yield 

\\Tx -n\\2< \\Tx - Tj;||,P[E^](i/2)-(i/^) 
<{\\Tx\U+\\T'x\U)P[El](^-'^/^'^l 

Hence, by (4.8), (4.11), and (4.14), 

||Ta - Tj;||2 < CXpf^iXTipx))^"-^^^^^"^ < CA-3 (4.15) 
which implies that 

E[\T'^-Tx\]<CX-^ (4.16) 
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which we use later. Since 

Var[T,] = VarK] + Var(T, - T^) + 2Cov(Tj;, T, - T^), 
by (4.15), (4.13), (4.9) and the Cauchy-Schwarz inequahty, we obtain 

|Var[r;,]-VarK]| <CA-^ (4.17) 

Given the five observations (i)-(v), we are now ready to apply Lemma 4.1 to 
prove Theorem 2.1. By (4.13) and (4.17), Var[rx], as a function of A, is bounded 
on bounded intervals. Hence, it suffices to prove that there exists Aq > 2 such that 
(2.6) holds for all A > Aq, since we can then extend (2.6) to all A > 2 by changing 
C if necessary. Trivially, (2.6) holds for large enough A when Var[TA] < 1, and so 
without loss of generality we now assume Var[T)^] > 1. To establish the error bound 
(2.6) in this case, we apply the bound (4.1) to Wi :— Si — 'ESi, 1 <i < Vx, with 

Our choice of ^ is applicable by (4.12). We clearly have E [W,] = and E [(E2?^ Wif] 
1. With S — Y^=\^ Lemma 4.1 along with observation (i) above yields 

sup \P[S <t\- $(t)| < CXpl^VaxT'^Y'^l^p^^''^''^''' 
t 

< C\{YwTxy^'^p^, (4.18) 

where the last line makes use of the fact that Var[T(] > Var[Tx]/2, which follows 
(for A large) from (4.17). 

Now if /3 > is a constant and Z any random variable then by (4.18) we have 
for alH e M 

P[Z <t\<P[S <t + p]+ P[\Z -s\>p] 
< ^{t + /?) + CA(VarrA)-«/V? + P[\Z -S\>(5\ 
< ^t) +Cf3 + CA(Varr;,)-«/V? + P[\Z - S\ > f3] 
by the Lipschitz property of Similarly for all i e M, 

P[Z <t]> $(t) -Cf3- CA(VarTA)-'^/V? - P[\Z - S\ > (3]. 
In other words 

sup \P[Z <t]- $(t)| < C/3 + CA(VarrA)-^/Vf + P[\Z ~ S\ > f3]. (4.19) 
t 

Now by definition of S, 

|(VarTj;)-V2(T, - ET,) - S\ = |(VarT{)-V2((T, - ET,) - {T', - ET{)}| 
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<(Varn)-V2{|r,-n|+E[|T,-T;|]} 

which by (4.16) is bounded by CX"^ except possibly on the set which has proba- 
bihty less than C\-^ by (4.11) and (4.8). Thus by (4.19) with Z = (Varr()-V2(r^_ 
ET^) and/? = CA-3 

sup \P[{YaxT'^)-'/\T^ -ET^)<t]- ^t)\ < CA(VarrA)"'/V? + CX'^ (4.20) 
t 

Moreover, by the triangle inequality 

sup |P[(VarTA)-^/'(TA - ETa) < t] - $(t)| < 



< sup 
t 



(VarTj;)-i/2(T, - ET,) < t ■ 



VarTj; 



+ sup 

t 



,VarT; 



VarTAxi/2 
VarTj;^ 



+ 



*(.(^)-)-*W 



(4.21) 



Since for all s < t, we have |$(s) — $(t)| < (t — s) ma.Xs<u<t 4>{u) where denotes 
the standard normal density, and since by (4.17) there is a constant < C < oo 
such that for all A > and alH e IR 

C\t\ 



.VarT;^, 1 /9 



< \t\ 


VarTA 


< 




VarT( 





we get 



sup 
t 



VarT^ 



A 



< C sup 
t 



\t\ 

A2 



A2 



max (h(u 
ue[t-tc/x'2, t+tc/x"^] 



< 



C 
A2' 



Thus by (4.20) and (4.21), 

sup \P[{Va.rTx)-'/\Tx - ET^) <t]- $(t)| < CA(VarTA)-^/V? + CX' 



(4.22) 



Finally we can deduce from (4.13) and (4.17) that VarTx = 0{Xp 



d{p+2)/p^ 
A y 



. Hence, 

under the assumptions of Theorem 2.1, in (4.22) the first term in the right hand 
side dominates, thus yielding the desired bound (2.6), and the proof of Theorem 2.1 
is complete. 

Under the assumptions of Theorem 2.2, provided (T^(/, ^,«;) > the right hand 
side of (4.22) is bounded by CA~^/^p^^, and since in this case we set px = X'^ 
with a given by (4.10), some elementary algebra yields (2.9). Our assumption that 
7 > (i(150 + 6/p) then yields the central limit theorem behavior (2.8), which is also 
trivially true in the case where (7^(/, ^, k) = 0. This completes the proof of Theorem 
2.2. □ 
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